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Recent advances have resulted in queue-based algorithms for 
medium access control which operate in a distributed fash- 
ion, and yet achieve the optimal throughput performance 
of centralized scheduling algorithms. However, fundamental 
performance bounds reveal that the "cautious" activation 
rules involved in establishing throughput optimality tend to 
produce extremely large delays, typically growing exponen- 
tially in 1/(1 — p), with p the load of the system, in contrast 
to the usual linear growth. 

Motivated by that issue, we explore to what extent more 
"aggressive" schemes can improve the delay performance. 
Our main finding is that aggressive activation rules induce 
a lingering effect, where individual nodes retain possession 
of a shared resource for excessive lengths of time even while 
a majority of other nodes idle. Using central limit theorem 
type arguments, we prove that the idleness induced by the 
lingering effect may cause the delays to grow with 1/(1 — p) 
at a quadratic rate. To the best of our knowledge, these 
are the first mathematical results illuminating the lingering 
effect and quantifying the performance impact. 

In addition extensive simulation experiments are conducted 
to illustrate and validate the various analytical results. 
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G.3 [Probability and Statistics]: Markov processes 
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As networks continue to grow in size and complexity, they 
increasingly rely on scheduling algorithms for efficient alloca- 
tion of shared resources and arbitration between users. As a 
result, the design and analysis of scheduling algorithms for 
complex network scenarios has attracted significant atten- 
tion over the last several years. 

One of the centerpieces in the scheduling literature is the 
celebrated Max Weight algorithm as proposed in the seminal 
work [201 121] . The Max Weight algorithm provides through- 
put optimality and maximum queue stability in a variety 
of scenarios, and has emerged as a powerful paradigm in 
cross-layer control and resource allocation problems [B]. 

While not strictly optimal in terms of delay performance, 
Max Weight algorithms do achieve so-called equivalent work- 
load minimization and offer favorable scaling characteristics 
in heavy-traffic conditions |15l I18| . As a further key appeal- 
ing feature, Max Weight algorithms only need information 
on the queue lengths and instantaneous service rates, and 
do not rely on any explicit knowledge of the underlying sys- 
tem parameters. 

On the downside, solving the maximum- weight problem 
tends to be challenging and potentially NP-hard. This is ex- 
acerbated in a network setting, where a centralized control 
entity may be lacking or require global state information, 
creating a substantial communication overhead in addition 
to the computational burden. This concern is especially per- 
tinent as the maximum-weight problem needs to be solved 
at a high pace, commensurate with the fast time scale on 
which scheduling algorithms typically need to operate. 

This issue has provided a strong impetus for devising 
algorithms that entail lower computational complexity and 
communication overhead, but (partly) retain the maximum 
stability and throughput guarantees of the MaxWeight algo- 
rithm. Various approaches in that direction were proposed 
in HI [TH [TH [171 [SI El] . An exciting breakthrough in this 
quest was recently achieved in the design of random back- 
off schemes for wireless medium access control that seem 
to offer the best of both worlds. These schemes operate 
in a distributed fashion, requiring no centralized control en- 
tity or global state information, and yet, rather remarkably, 
provide the capability of achieving throughput optimality and 
maximum stability. More specifically, clever algorithms have 
been developed for finding the back-off rates that yield any 
given target throughput vector in the capacity region or that 
optimize a certain concave throughput utility function in 
scenarios with saturated buffers [3 1101 111] . In the same 
spirit, powerful algorithms have been devised for adapting 
the back-off rates based on queue length information, and 



been shown to guarantee maximum stability [8l 1131 [14] . 

While the maximum-stability guarantees for the above- 
mentioned algorithms have strong appeal, they do not ex- 
tend to performance metrics such as expected queue lengths 
or delays. In fact, fundamental performance bounds [l] indi- 
cate that the "cautious" back-off functions involved in estab- 
lishing maximum stability tend to produce extremely large 
delays, typically growing exponentially in 1/(1 — p), with p 
the load of the system, in contrast to the usual linear growth. 
More specifically, the bounds show that the expected queue 
lengths grow as — P) as P t 1- Here -i/> -1 (-) represents 

the inverse of the (decreasing) function ^(-), specifying the 
probability of a node entering a back-off as a function of 
its current queue length. The bounds may be explained by 
noting that the queue lengths govern the fraction of back-off 
time through the function ^(-). Since the fraction of back- 
off time cannot exceed the surplus capacity in order for the 
system to be stable, however, it is ultimately the amount 
of surplus 1 — p that dictates the queue lengths through 
the function ip~ 1 (-). We note that maximum stability has 
been established under the condition that the function ip(q) 
decays (no faster than) inverse-logarithmically as q — > oo. 
This entails that i/j -1 (s) grows (no slower than) exponen- 
tially in 1/s as s 4- 0, yielding the stated exponential growth 

of V _1 (l-/o) in 1/(1- p) as pt 1- 

The above lower bounds suggest that the delay perfor- 
mance may be improved when the function ip(-) decays faster, 
e.g., inverse-polynomially: ?/<((/) ~ q~^ , with /? > 0, so that 
tp~ 1 (s) ~ s~ 1///3 as s 4- 0. The larger the value of the expo- 
nent /3, the slower the growth of i/> _1 (1 — p) ~ (1 — p)~ 1/ '' 3 as 
p f 1. In particular, it might seem plausible that for /3 > 1, 
the expected queue lengths will only exhibit the usual linear 
growth in 1/(1 — p) as p f 1. Note that a larger value of /? 
means that a node is more "aggressive", in the sense that it 
is less likely to enter a back-off and more inclined to hold on 
to the medium, and hence the coefficient /3 will be referred 
to as the aggressiveness parameter. It is worth observing 
that maximum stability for the above back-off functions is 
not guaranteed by existing results, which do not apply for 
any ft > 0. In fact, for j3 > 1, maximum stability has been 
shown not to hold in certain topologies [7J. 

In the present paper we aim to gain fundamental insight 
whether a larger aggressiveness parameter can improve the 
delay performance. Our main contributions are twofold: we 
reveal a lingering effect that arises for large values of /3, e.g. 
f3 > 1, and we quantitatively examine the impact on perfor- 
mance in terms of queue lengths and delays in a simple, yet 
representative scenario. To the best of our knowledge, the 
present paper is the first to expose the lingering effect and 
quantify its performance impact. 

Let us present an illustrative example. Suppose we have 
a group of R nodes that can all be active concurrently, but 
cannot be active simultaneously with any other node in the 
network, and assume that these R nodes activate with initial 
queue lengths of the same order, say Q. When j3 is large, 
it turns out that nodes retain the shared resource until the 
time T* at which the last among the group of R nodes has 
emptied, thus preventing other nodes from activating until 
this time. The law of large numbers suggests that T* is of 
order Q, but the central limit theorem suggests that up to 
time T* , some of the nodes of the group, while waiting for 
the last node to empty, will be empty a fraction y[Q of the 



time. This lingering effect is illustrated in Figure [T] which 
will explained in greater detail in Section [4] 
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Figure 1: A sample path realization of the 2x2 net- 
work analyzed in later sections. The three boxes 
zoom in to show the lingering effect. One queue 
hovers around zero while the other queue is yet to 
empty, resulting in an efficient use of the resource. 

This y^-fraction of the time the shared resource is used 
nefficiently may at first sight seem negligible, and indeed 
queues seem to empty at the same time on the coarse time 
scale of Figure [1] However, we will actually establish that 
it has a significant impact in heavy traffic, causing queue 
lengths to grow at a rate 1/(1 — p) 2 asptl, instead of the 
optimal 1/(1 — p). 

In order to prove this scaling result, we focus on a spe- 
cific setting, namely a symmetric system with two groups of 
R nodes each. Only nodes within the same group can be ac- 
tivated simultaneously. While admittedly simple, the model 
is sufficiently rich to capture the essential features of more 
complex networks, namely a combination of parallel service 
of certain queues and alternating service between conflicting 
queues. Indeed, the above-described heuristic arguments in- 
dicate that the lingering effect can rear its head and cause 
the delays to grow at a rate 1/(1 — p) 2 in a far broader range 
of scenarios. 

The remainder of the paper is organized as follows. In 
Section [2] we present a detailed model description. In Sec- 
tion [3] we provide an overview of the results and discuss the 
main performance implications. We conduct comprehensive 
simulation experiments to illustrate the various heavy-traffic 
results in Section 2] In Section [S] we present a detailed 
asymptotic analysis and proof arguments. Section[6]provides 
concluding remarks and suggestions for further research. 

2. MODEL DESCRIPTION 
2.1 Informal description 

As mentioned in the introduction, in order to analyze the 
lingering effect in a specific setting, we focus on a symmetric 
system consisting of two groups of R > 2 queues. At any 
given point in time, one of the groups is active while the 
other is inactive. 

Time is slotted and inactive queues have simple dynamics, 
driven by independent and identically distributed numbers 
of packet arrivals in each slot, so that they each simply grow 
according to random walks with step distribution denoted 



by £. During each time slot, active queues increase by in- 
dependent amounts distributed as £ as well, but, if at least 
one packet is present at the start of the slot, an active queue 
also flushes exactly one packet. 

Moreover, at the end of each time slot, each active queue 
tosses a coin and advertizes a momentary release with prob- 
ability tp(a), with a the number of packets in the queue at 
the end of the time slot. This (momentary) release gives in- 
active queues an opportunity to become active: if all the ac- 
tive queues simultaneously advertize a release, then inactive 
queues become active and active queues become inactive. 
Such a time is called a switching time. We will assume in 
the sequel that ip(a) — (1 + a)' 13 for some parameter /3 > 0, 
called aggressiveness parameter. In particular, ip(a) — > as 
a — > +00, and so active queues are less likely to advertize 
a release when they are highly loaded; this mechanism thus 
gives priority to highly loaded queues in a distributed fash- 
ion. There is a cost associated with advertizing a release: 
each time an active queue advertizes a release and is not 
empty, it incurs an additional increase distributed accord- 
ing to some random variable (. 

The above-described model qualitatively resembles the canon- 
ical models for queue-based medium access control mecha- 
nisms. The main difference with continuous-time models 
in the literature is that in our model, back-off periods are 
infinitesimally short (hence, the releases are qualified as mo- 
mentary), and the cost £ represents the number of packets 
that would have arrived during a non-zero back-off period. 
The key advantage of our model is that it avoids the com- 
plication of requiring back-off periods to overlap to define a 
switching time. Moreover, simulation experiments show that 
our main results extend to models with non-infinitesimal 
back-off periods. We did not include these simulations to 
comply with page limitations. 

2.2 Parameters 

The model described in the previous subsection is defined 
through four parameters: the number R > 2 of queues in 
each group, two integer- valued random variables £, £ G N = 
{0, 1, • • •} and a real number (3 G (0, 00], which defines the 
[0, l]-valued sequence (ip(a),a > 0) via ip(a) = (1 + a) - ' 3 
for a > 0, to be understood for /3 = +00 as %p(0) = 1 and 
ip(a) — for a > 0. Note that only the asymptotic behavior 
of ip(-) matters, and our results could easily be extended to 
any ?/>(•) with a^ip{a) — ¥ i £ (0, 00) as a — > +00. 
We assume that £ and £ have finite means, respectively E£ = 
p/2 and E£ = z, and that £ has finite variance denoted by 
v = E[(£ — p/2) 2 ]. It will be argued that the system is stable 
if and only if p < 1 and so we will refer to p as the load of 
the system. 

2.3 Formal description 

Because of the symmetry of the system, we do not need to 
label the queues individually, but only need to keep track of 
the state of active and inactive queues. We will consider the 
system embedded at switching times, and define Qr(k) and 
Ql (k) as the numbers of packets in the rth active and inac- 
tive queue, respectively, just after the kth switch occurred. 
We will be interested in the Markov chain (Q(fc),fc > 0) 
which we also write as Q = (Q a , Q*) with Q a/i = {Qr'\ r G 
72), where in the sequel 72 — {1, . . . , R} and we reserve bold 
notation for vectors (of functions or numbers). 

As informally described in Section 12.11 the dynamics of 



Q in between two switching times are governed by two R- 
dimensional processes S = (S r , r £ 72.) and A = (A r , r € 72.): 
S gives the increments of the inactive queues, while A gives 
the state of the active queues. The dynamics are as follows: 

• the 2R processes A r ,S r are independent; 

• for each r G 72, (S r (k), k > 0) is a random walk with 
step distribution £ and started at 0; 

• for each r G 72, (A r (k), k > 0) is a space- inhomogeneous 
random walk with the following dynamics: for any 
a G N and any function / : N — > [0, 00), we have 

E[/(Ar(l)) I Ar(0) = a] 

= E [/ (Y(a) + Cl{Y(a)>0,U<,p(Y(a))})] , (1) 

where Y (a) = a+£— l{ a >o} i s the number of packets at 
the end of the time slot and U is uniformly distributed 
in [0, 1], and U, £ and £ are independent. 

The equation fT} describes the dynamics of an active queue 
and can be interpreted as follows. At each time slot, an ac- 
tive queue increases by £ and, if not empty at the beginning 
of the time slot, flushes a packet, which brings the queue to 
state Y(a). If U < ip(Y(a)), we say that the active queue 
advertizes a release, which thus happens with the (condi- 
tional) probability ip(Y(a)) as prescribed in Section [2.11 If 
at the end of the time slot the queue is not empty and it 
advertizes a release, i.e., Y(a) > and U < tj)(Y(a)), then 
the active queue also incurs an additional increase by 

To define the 27?-dimensional process Q = (Q a , Q 1 ) from 
S and A, it remains to adopt notation for the switching time, 
which we denote by T*. Thus T* is the first time at which 
all active queues advertize a release at the same time. Note 
that T* and S are independent. With these definitions, the 
dynamics of Q as informally described in Section 12.11 obey 
the following equation: for any q = (q°, q 1 ) G N fl x N fl and 
any function / : N 2i * -* [0, oo), 

E[/(Q(1)) | Q(0)=q] 

= E [/(q* + S(T*), A(T*)) I A(0) = q a ] . (2) 

The special case j3 = +<x will be of particular impor- 
tance. Indeed, we will show that it is representative of the 
system's behavior in the range /3 > 1. The case f3 = +oo 
has been studied in [4], who refer to it as the random cap- 
ture algorithm. When /3 — +oo, active queues only adver- 
tize a release when they are empty, and so A(T*) = and 
T* = inf {k > : A(0) = 0}. 

2.4 Additional notation 

In the remainder of the paper, and similarly as we have 
just done in (0, we will use the common symbol E to denote 
expectation with respect to the laws of Q and (A,S). Ini- 
tial conditions will be denoted by a subscript, and it should 
always be clear from the context whether we consider initial 
conditions of Q, A or some A r (remember that S(0) is al- 
ways equal to 0). For instance, |T]) and @ can be rewritten 
as follows: 

E a [/Wl))] = E [/ (Y(a) + a m «)>o,E/<^(Y( ))})] , 
and 

E q [/(Q(l))] = Eqa [/(q< + S(T*), A(T*))] . 



The probability distributions corresponding to these various 
expectations are written as F a , P, P q and P q o. We also 
define P^ with corresponding expectation Eoo as the laws 
of Q and (A, S) started in the stationary distribution of Q, 
provided Q is positive recurrent. 

When /3 = +00, we see based on fl2} that CT(fc) = 0, 
except maybe for k = 0. In particular, when Q is posi- 
tive recurrent, Q l (0) is Poo-almost surely equal to and <(2j) 
therefore becomes 

Eoo[/(Q a (0))] =E oc [/(S(T*))] (0 = +00). (3) 

Further, in the remainder of the paper we let 

t,. = inf{fc > : A r (k) = 0}, r max = maxr r , 

and we define the T( r ) 's as the order statistics of the r r 's, i.e., 

< ■ ■ ■ < T (R) and {T(r)} = {tv}- Let finally |-| be the 
norm and ||-|| be the L\ norm, i.e., if J > 1 and x 6 R J then 
x = maxjl^jl (which is just the absolute value for J = 1) 
and ||x|| — \xi \ + ■ ■ ■ + \xj\. 

2.5 Connection with polling systems 

It is worth emphasizing that we restrict the investigation 
in the next sections to the case of R > 2 queues in each 
group, and exclude the case R — 1 from the analysis. Indeed, 
the lingering effect that we intend to investigate only occurs 
when there are R > 2 queues in the same group. 

The case R — 1 may be interpreted as a single-server two- 
class queueing system, where the server may switch from 
one class to the other after each service completion with a 
probability that depends on the queue length of the class 
that is currently being served. As a somewhat unusual fea- 
ture, the queue length of the latter class increases by the 
random variable £ when a switch occurs. This bears some 
resemblance with a polling system, when £ is viewed as the 
number of arrivals during a switch-over time. It is worth ob- 
serving that the switching rule does not belong to the class 
of branching-type service disciplines which yield tractable 
joint queue length distributions in polling models. The only 
exception is the special case f3 = +00, which corresponds to 
the exhaustive service discipline in polling models (with a 
process-level heavy-traffic analysis in [3]) and the so-called 
random capture algorithm in [3]. In general however, the 
analysis of the joint queue length process appears far from 
trivial, even when £ = 0, although the aggregate queue 
length distribution is then fairly easy to obtain. 

In case R > 2, the system may in the same vein be inter- 
preted as a set of R single-server two-queue polling systems, 
where servers are only allowed to switch between queues 
in a synchronized fashion. Such models with simultaneous 
service of several queues and synchronized switches are nat- 
ural in applications (e.g., traffic lights at intersections), but 
appear not to have been considered. These models show 
similarity with multiple-server polling systems, which did re- 
ceive some attention, but have largely defied analysis. This 
offers testimony of the mathematical complexity of an ex- 
act queueing analysis of the model under consideration, and 
provides justification for an asymptotic investigation. 

3. OVERVIEW OF MAIN RESULTS 
3.1 Two regimes 



As stated in the introduction, we aim to gain fundamen- 
tal insight in the impact of the function ip(-), through the 
aggressiveness parameter /3, on the system performance in 
terms of expected queue lengths and delays. We will demon- 
strate, based on a combination of heuristic arguments, simu- 
lation experiments and theoretical results, that the system's 
behavior changes as (3 increases from to 00. In the small 
P regime, typically j3 < 1/2, the system is characterized 
by a mean-reverting effect that induces (stationary) queue 
lengths of the order of l/(l~p) 1/fi . This regime was already 
hinted upon in T], where a corresponding lower bound was 
rigorously proved in a general setting. In the present paper 
we go one step further, by providing a heuristic argument 
in Section \3. 2 1 explaining why this lower bound is sharp for 
small P, and corroborating this by extensive simulation re- 
sults in Section T4. 21 However, when f3 increases, this mean- 
reverting effect vanishes, which causes the lower bound to 
become loose. For j3 > 1, we show that the system's perfor- 
mance is dominated by another phenomenon, which we call 
a lingering effect. The investigation of this latter effect con- 
stitutes the main contribution of the paper. We provide a 
heuristic explanation of this effect in Section [3]3] examine it 
via simulation experiments in Section [472J and prove various 
theoretical results in Section [5] 

3.2 Small p: a mean-reverting effect 

Consider for a moment a given queue and let A(a) be the 
mean increase of this queue starting from level a. Because of 
symmetry, the queue will be active half the time and inactive 
the other half of the time, so that 

1 1 

A(a) ~ -E a [increase | inactive] + — E a [increase | active] . 

From the dynamics described in Section [53] we get for large 
a (approximating Y(a) « a and neglecting the possibility of 
Y(a) = 0) 2A(a) « Ef + E(£— l + Cl{£/<y(a)}); which leads 
to 

2A(a) w p - 1 + ztp(a). 

This last approximation points to a mean-reverting effect: 
the average drift is negative for a > a* and positive for 
a < a* , where a* is determined by the equation p — 1 + 
zip(a*) = 0, which gives, since ip(a) = (1 + a)" 13 , a* ~ 1/(1 — 
p) 1 ^ ■ Using the convexity of ip(-) and Jensen's inequality, 
it is not difficult to convert the above back-of-the-envelope 
computations into a rigorous proof and show that 1/(1 — 
p) 1///3 is a lower bound for the statoonary mean number of 
packets in the system, as established in [l] in a continuous- 
time setting. 

However, our explanation of this lower bound through this 
mean-reverting effect goes one step further, and shows that 
this lower bound should be sharp for small values of /3. In- 
deed, A (a) is the mean drift obtained by averaging over 
the active and inactive states. This quantity accurately de- 
scribes the drift experienced by a queue if the queue rapidly 
changes states, typically, if the time needed for the queue to 
switch state is negligible compared to n, with n the length 
of the queue. Otherwise, the queue stays in the active or 
inactive state for a long period of time, during which it ex- 
periences (almost) constant drift, either positive or negative. 
This explanation bears some similarity with a random walk 
in a dynamical random environment, where the random walk 
essentially sees a constant environment if the random envi- 
ronment mixes rapidly. 



To conclude our arguments, it remains to note that, at 
least intuitively, queues will switch more rapidly for smaller 
values of f3. We will provide simulation experiments showing 
that the stationary mean number of packets grows as 1/(1 — 
p) 1 /' 3 for small values of ft. As (3 increases however, this 
mean-reverting effect vanishes. The main contribution of 
the paper is to reveal and quantify a previously unknown 
effect, called the lingering effect, which explains the system's 
behavior in this case. 

3.3 Large p: the effect of lingering 

As described in the introduction, when the function ip(-) 
decays sufficiently fast, once a queue gains possession of the 
resource, it holds onto it, even when some or all of the other 
queues in the same group are empty, and it would be more 
efficient for the queues in the other group to receive the 
resource. This causes a lingering effect as illustrated in Fig- 
ure [1] for a scenario with R = 2, f3 = 2. ft may appear that 
the two queues in the same group drain around the same 
time (as can indeed be shown to be the case on a "fluid 
scale"). When we zoom in, however, we see that there is 
actually a time period where one of the queues is already 
empty, while the other one clings to the resource and pre- 
vents the two queues in the other group from activating. 

Let us now provide a somewhat more technical description 
of this phenomenon in the case j3 = +oo, where it is easi- 
est to see thanks to the simplification of the dynamics Q. 
Moreover, we will argue that the case f3 6 (1, +oo) is essen- 
tially a perturbation of the case f3 = +oo. 

By applying © to / = ||-||, we obtain E^ (||Q a (0)||) = 
Eoo (||S(T*)||) and since ||S(-)|| is a random walk with drift 
KEt; independent of T* and A(0) = Q a (0) (by definition), 
we obtain (||A(0)||) = RE^E^ (T*) and so by symme- 
try, 



Eoo (Ai(0))=E(OEoo (T*) 



(4) 



The goal is now to relate E 00 (T*) to E oo (Ai(0)). Remem- 
ber that T* = inf{fc : A(T*) = 0} when (3 = +oo. ft is 



not difficult to show that T* 



essentially because 



once all queues have hit it is only a matter of constant 
time for all queues to be empty simultaneously (this will 
be justified in Lemma 15.11) . Then, the central limit the- 
orem shows that r r » A r (0)/(1 - E£) + A r (0) 1/2 (where 
we neglect constants, possibly random, appearing in front 
of second-order terms), which leads to the approximation 
T max w |A(0)|/(1-E£) + |A(0)| 1/2 . Since under queues 
are symmetric, we have |A(0)| ~ Ai(0) + Ai(0) 1//2 which 
finally leads to T* « Ai(0)/(1 - Ef) + Ai(0) 1/2 , i.e., 

Eoo (Ai(0)) « Y^Ei Eo ° (Al(0)) + Eo ° [ Al (°) 1/2 ] ■ 
Thus upon a concentration-like result of the kind 

E^A^O) 1 ' 2 ] « pMA!(0))] 1/2 
it is reasonable to expect 

E? ^ .{AxiW^lE^AxiQ))} 1 ' 2 . 



1-E£ 

Since 1 - E(£)/(l - E£) w 1 - p this shows that E oo (Ai(0)), 
and hence E oo (||Q(0)||), should grow as 1/(1 - p) 2 . While 
admittedly crude, the above heuristic arguments provide the 
correct estimates, and serve as a useful guide for a rigorous 
proof in Section [5~T1 



As reflected in the above computations, the square factor 
really stems from the relation T* ~ rm + |A(0)| 1,/2 , i.e., 
T* occurs somehow long after rm, the time at which it 
would be optimal to switch in order to avoid inefficient use 
of the resource. But it is difficult to make the system switch 
exactly at Tm in a distributed fashion, and here the penalty 
incurred is a square root. Interestingly, the penalty may 
seem negligible, since it is only a square root, but this small 
inefficiency has a significant impact in heavy traffic. 

We explain in the next subsection how we formalize the 
analysis of this square root effect. 

3.4 Main results 

In this subsection we present our main results, which are 
discussed in the following sections based on a combination of 
rigorous proofs, heuristic arguments and simulation results. 

Our first main result introduces the notion of scaling ex- 
ponent, which essentially determines the polynomial growth 
rate of the mean number of packets in stationarity. 

Main result 1. Q is positive recurrent if p < 1 and 
transient if p > 1. Moreover, the limit 



lin, | lo g E °°(IIQ( )ll) ^ djf. Q 



P n V iog(i/(i-p)) 

exists and is called scaling exponent 



(5) 



The transience of Q when p > 1 is easy to see. Indeed, 
if L(k) is the number of packets at the beginning of the fcth 
time slot, it is clear from the system's dynamics that L is 
lower bounded by a random walk with drift R(p — 1). Thus 
when p > 1, we have L(k) — s> +oo as k — > +oo and since 
(||Q(fc)||, k > 0) is a subsequence of (L(k), k > 0) this proves 
the transience of Q. 

Stability for p < 1 is on the other hand more challenging 
to establish. It is however very intuitive: indeed, if the active 
queues are in state a = (a r ) with a r > for every r, the 
variation of the mean number of packets in the system over 
the next time slot is equal to 



-R 1 



Since tp(a) — > as a — > +oo, this suggests a negative drift, 
close to —R(l — p), when each a r is large enough. The prob- 
lem to formalize this argument is twofold: first, in order 
to prove stability, one must be able to control every pos- 
sible initial configuration, not only those where every a r is 
large (note however that because of symmetry, it is natural 
to expect all a r 's to be large or small simultaneously); sec- 
ond, this argument considers the system on the normal time 
scale, whereas we are interested in the system embedded at 
switching times. 

In Section TS-ll we will prove positive recurrence of Q when 
p < 1 and ft = +oo and explain in Section 15.21 via a per- 
turbation argument why it should hold for any f3 > 1. For 
(3 < 1, the above heuristic argument will be complemented 
by extensive simulation results. 

Our interest in the scaling exponent comes from an ex- 
pected polynomial growth of E oo (||Q(0)||) in heavy traffic. 
In general, we expect as p f 1 a behavior of the kind 



E 



IQ(o)|| 



c 



(1 - p)« 



(6) 



for some finite constant C > (see also Theorem 15.51 where 
a result consistent with ([6]) and stronger than ([5]) is proved). 
The scaling exponent depends on the four model parame- 
ters R, £, ( and /3. However, we will be mostly interested 
in the dependence of a on /J and thus write a(/3) when the 
other three parameters are kept fixed. In fact, our results 
suggest that a does not depend on these other parameters, 
at least in the two extreme regimes we studied in detail. 
As explained in Section [373] we expect the lingering effect 



to make Eoo(||Q(0)||) grow as 1/(1 — p) when j3 = +oo, thus 
predicting a(oo) — 2. We will prove this result, and give 
(perturbation) arguments, backed up by simulation results, 
explaining why it should also hold for /? > 1. 

Main result 2. If /3 > 1, then a{0) = 2. 

It seems quite challenging to prove the existence of and 
find a closed-form expression for a(/3) when /3 < 1. Nonethe- 
less, when /3 4- we gave in Section 13.21 heuristic arguments 
showing that the lower bound ct(fi) > 1//3 of [T] is sharp, 
and we will support this result with extensive simulations. 

Main result 3. /3a(/3) ~ > 1 as j3 — > 0. 

It is an exciting open question whether for intermediate 
values of /3, the mean-reverting and lingering effects can co- 
exist and lead to scaling exponents in between 1//3 and 2. 

4. SIMULATION EXPERIMENTS 

In the previous section we provided an overview of the 
main results characterizing the heavy-traffic behavior of the 
expected queue lengths. Before presenting detailed proof ar- 
guments in the case j3 > 1 in the next section, we first discuss 
comprehensive simulation experiments that we conducted to 
illustrate the stated growth behavior. 

The detailed asymptotic analysis and proofs in the next 
section will reveal that the value of R and the precise distri- 
butions of £ and £ do not affect the stability of the system or 
the value of the scaling exponent. Throughout this section 
we therefore focus on the case where £ is geometrically dis- 
tributed with parameter 2/(2 + p) and P(£ = 1) = 1. We ran 
simulations using different distributions for £ and ( as well, 
including extreme cases such as distributions with infinite 
third moment (even infinite second moment for (). Because 
of page limitations we do not include these cases, but they 
yielded very similar results. 

4.1 Simulations for the main result [1] 




We now examine the case R = /3 = 2 in detail. Figures [2] 
and [3] show the evolution of ||Q(fc)||, starting at Q(0) = 0, 
for p = 0.99 and p = 1.01 respectively. When p = 0.99, 
Figure [2] shows that ||Q(fc)|| fluctuates between 2000 and 
8000 for k large enough, which strongly suggests that Q is 
positive recurrent. 




Figure 3: ||Q(fe)|| vs. k for R = /3 = 2 and p = 1.01. 

When p — 1.01, Figure [3] shows that ||Q(fe)|| increases 
until we stop the simulation when 1.5- 10 packets are present 
in the system. Note that for a transient system we would 
expect that, when the queues are large, the total queue size 
grows by a constant amount on average in every time slot 
and that the time between two consecutive switching times 
increases. This explains the super-linear growth of ||Q(fe)|| 
as we consider the system at switching times. In fact, the 
reasoning in Section T3 .3 1 suggests that when ||Q(fe)|| is large 
we have ||Q(fe + l)||/||Q(fe)]| » Ef/(1 - E£). This gives 
a heuristic explanation for the exponential growth (at rate 
pi (2 — p) > 1) observed in Figure [3] All in all, this suggests 
that Q is transient for p — 1.01. 

Observe moreover that it may in general be difficult to 
distinguish between positive recurrent and transient sys- 
tems based on simulation results. Here however, Q obeys 
two clearly distinguishable behaviors: stochastic fluctua- 
tions when p < 1, and almost deterministic exponential 
growth when p > 1, which makes, in our view, the simu- 
lation results quite reliable. 

4.2 Simulations for the main results |2] and |3| 




log(l/(l - p)) 



Figure 2: ||Q(fc)|| vs. k for R = /3 = 2 and p = 0.99. 



Figure 4: Approximating q(2) for R — 2. 

From the results in Figure[2]we find by averaging over time 
Eoo(||Q(0)||) w 4700 for p = 0.99, corresponding in view of 



the definition ((5JI of the scaiing exponent to the estimate 
a(2) » log 4700/ log 100 « 1.84. To simplify the discussion 
define in the sequel 



F(p,P) 



logEoodlQWH) 
log(l/(l-p)) : 



p£ (0,oo],p< 1. (7) 



Then, the scaling exponent ot{0) is defined in ([5]) via a lim- 
iting procedure, namely a(/3) = F(l— , /?) = lim p -|-i F(p, 0). 
Using the results of Figure [5] to estimate a (2) amounts to 
using the approximation F(l— ,2) « F(0.99, 2). In order 
to check whether this approximation is valid, we performed 
the same simulation for different values of p. In Figure 0] 
the value of F(p, 2) is plotted for different values of p £ 
(0.87,0.999). F(p,2) is plotted versus log(l/(l - p)) in or- 
der to "dilate" time around the value p = 1 that we are 
interested in (/3a(/3) is plotted versus 1//? in Figure [6] for 
the same reason), and also because it is natural to regress 
F(p,2), as function of p, against log(l/(l — p)) (sec forth- 
coming discussion). 

Figure [4] shows that the limit F(l—, 2) seems to exist, but 
F(p, 2) is still significantly increasing for p = 0.999. Thus 
-F(0.999, 2), and in particular .F(0.99, 2), cannot be used as 
an estimate of a(2). It is numerically difficult to run a simu- 
lation for even higher values of p, and so to circumvent this 
problem we use the simulation results displayed in Figure [4] 
to find the asymptotic value of F(p, 2). To do so, we use the 
approximation to infer the form of F(p,j3), namely 



F(p,/3) «a(/J) + 



log C 



log(l/(l-p))' 



(8) 



which suggests, as mentioned above, to regress F(p, 2) against 
a + b/x in the scale x — log(l/(l — p)). We performed this 
regression for the curve displayed in Figure [4] and found an 
optimal value a — 1.9984 w a(2). 




- Estimated a((3) for R = 2 

- Estimated c*(/3) for R = 3 

- Estimated for R = 5 

■ 1/P 



Figure 5: a(f3) vs. P for R 6 {2,3,5}. 

Applying the same approach, we can find an estimate for 
a(/3) for any value of /?. The results for R — 2, 3 and 5 are 
given in Figure [5] and confirm that R does not seem to in- 
fluence a((3). Further, the approximation a(P) ss 2 appears 
to be very good for any ft > 1.2, namely, the estimated value 
of q(/3) is at most 3% away from 2 for any f3 > 1.2 and any 
R — 2, 3, 5, in very good agreement with the main result [2] 
In Section 15.2.41 we will discuss in greater detail the case 
where /3 > 1 is close to 1. 



Estimated Pa(0) for R = 2 
Estimated /3a (/3) for R = 3 
ated fia{P) for R = 5 




l/f> 



Figure 6: f3a(/3) vs. 1//3 for Re {2,3,5}. 



Finally, for small values of /3, we observe that ct(/3) is close 
to the lower bound 1//3. To see how close a(P) is to the lower 
bound, we made a plot for /3a(/3) in Figure [6] We observe 
that Pa(fl) G (0.98, 1.02) for any /3 < .45, which supports 
our claim made in the main result [3] and the heuristic argu- 
ment given in Section \3. 21 



5. ANALYSIS OF THE CASE p e (l, oo] 
5.1 The case p = +00 

We study here in detail the case P = +00. In Section \5. 21 
we explain why the case P > 1 can be seen as a perturbation 
of this case. In the remainder of this subsection we assume 
that P = +00, so that A(T*) = and T* = inf{fc > : 
A(fc) = 0}. 

5.1.1 Control of T* 

We first prove that T* w r max , i.e., the time at which 
the R independent random walks A r simultaneously hit 0, 
is close to the first time at which each process has visited 
at least once. 

Lemma 5.1. sup a p E a (T* — r max ) is finite, where the supre- 
mum is taken over a £ N fl and p < 1. 

Proof. By monotonicity of T* in p, it is enough to show 
that sup a E a (T* — T m ax) is finite for p = 1. So assume in 
the remainder of the proof p = 1: then each active queue is 
stable (for that we only need p < 2) and in particular, T* 
is almost surely finite. Thus the strong Markov property at 
time Tmax < T* gives 

E a (T* - r max ) = E a [E A{Tmax) (T*)] . 

When A r hits for the first time at time r r> we can couple 
it with a stationary version A r of Ai in such a way that 
A r (k) > A r (k) for every k > r r . By monotonicity and since 
queues are independent, we obtain 

E a (T* — T max ) < E e (T*), 

where for any r > 2, we define g — (X\ , . . . , Xr) £ N fl with 
(Xi,i > 1) i.i.d. with common distribution the stationary 
distribution X of A r . 

Define <r(0) = and for k > 1, r(fc) = r max o 6 tT ^ k _i-j 
and cr(fc) = a(k — 1) + r(k), where 6 is the shift oper- 
ator. In words, a(k + 1) is the smallest time after cr(fc) 
such that every queue will have visited at least once after 
(and including) time a(k). Note that if A(<r(fc)) = for 



some k, then o~(k') = cr(k) for k' > k. In particular, the 
limit cr(oo) as k — > +00 exists, and is by construction equal 
to T*. Since r(fc) = cr(fc) — er(fc — 1) this can be rewritten 

asT*=E fc >i^)- 

By construction, A r takes the value at least once be- 
tween a(k) and a(k + 1): at this time we can couple it 
with a stationary version of itself that stays above it. Fol- 
lowing this stationary process from this time until the time 
a(k + 1), this gives the existence of i.i.d. random variables 
(X r (k),r 6 R,i; > 0) with common distribution X such 
that A r (a(k)) < X r (k) for every r and k. 

Considering the time needed for every queue to visit 
at least once starting from (X r (k),r £ TV) gives an upper 
bound on r(k). Each time we do so, there is a probability 
p — Pp(A(r max ) = 0) £ (0, 1) that we reach at time 
T max . Thus, T* is (stochastically) upper bounded by t{ + 
■ ■ ■ +tq with G a geometric random variable with parameter 
1 — p and (t/) i.i.d., independent from G and with common 
distribution r max under P e . This implies 



E e (T*) <E(rf + --.+7&) 



■ (Tmax) 



, (A(r max ) = 0) ' 



Since r max < n H + tr, E ar (r r ) = a r /(l — E£) and X 

has finite mean (because £ is assumed to have finite second 
moment), E e (r max ) is finite and so the proof is complete. □ 

The previous lemma justifies the approximation T* ~ 
T max , and to further control r max , we will use the fact that 
(j r ,r £ TV) under P a is equal in distribution to (V r {a r ),r £ 
TV), where (V r ,r £ TV) are i.i.d. random walks started at 
and with step distribution 8, equal in distribution to ti un- 
der Pi (i.e., 8 is the time needed for a random walk with 
step distribution £ — 1 to go from 1 to 0). Then 8 has finite 
mean 1/(1 — E£) and also finite variance, which we denote 
by v. To control the maximum of random walks, we will use 
the following result. 

Lemma 5.2. If (W r ) are i.i.d. random walks started at 
with step distribution having mean m and variance w, then 
for any x = (x r ) £ N fl it holds that 



E I max Wr(x r ) ) < m|x| + Rw\x 



1/2 



PROOF. Defining Y r = (H / r(ir)-ma:r)/(™a:r) 1 ' 2 , wehave 
max r W r (x r ) — max r (mx r + wx^ 2 Y r ) so that 

max W r (x r ) < max (mx r + wx^ 2 \Y r \ ) < mlxl+uilxl 17,2 !! Y|| 

r r \ / 

which proves the result since E(||Y||) = ]T r E(|Y r |) < R. □ 

5.1.2 Proof of a(oo) = 2 

Before proving the main result [2] in the case /3 = +00, we 
first need to prove stability and finiteness of the stationary 
mean. 

Proposition 5.3. Assume that ft — +00 and that p < 1. 
Then Q is positive recurrent and E oo (||Q(0)||) < +00. 



PROOF. Let $(q) = |q a | + |q'|: to prove Proposition 15.31 
it is enough to prove that 

lim sup fWOp_lM) <0 . (9) 



Indeed, this shows that $ is a Lyapunov function, which im- 
plies positive recurrence of Q using for instance the Foster- 
Lyapunov criterion. But it shows more than that: in the 
terminology of 5 it implies that $ is a geometric Lyapunov 
function, and Theorem 5 in [5] shows that Q implies that 
Eoo[$(Q(0))], and in particular £00(^(0)!!), is finite. Thus 
we only have to prove {9). 

Since |x + y| < |x| + |y|, (J2j) implies that 



l [4>(Q(l))]<|q|+E q .(|S(T*)| 



■E c 



|A(T* 



Since S and T* are independent, Lemma 15.21 gives that 
E a (|S(T*)|) < E(C)E a (T*) + RvE a ({T') 1/2 ) for any a £ N fl 
(recall that v is the variance of £, assumed to be finite). Thus 
after rearranging the terms, we end up with the bound 

E q [$(Q(1)) - $(q)] < -(1 - p)E q »(T*) + *(q a ), (10) 

where 

*(a) = E a (|A(T*)| + (1 - EC)T* - |A(0)| + Rv(T*) 1/2 ^ 



We now argue that ^(a) < c[E a (T*)] 1 ^ 2 for some finite con- 
stant c independent of a. Since we are considering the case 
(3 = +00, we have |A(T*)| = and so we only have to show 
that (1 — E£)E a (T*) - |a| < c[E a (T*)] 1/2 . By Lemma 1531 
we have E a (T*) < E a (r max ) + c' for some finite constant c' 
independent of a. Further, since r max is equal in distribu- 
tion to max r V r (a r ), Lemma 15.21 gives (recall that v is the 
variance of the step distribution of the Vr's) 



E a (T*) < 



1 -E£ 



+ Rv\&\ 1/2 + c 



which implies the existence of the desired constant c such 
that *(a) < c[E a (T*)] 1/2 . Defining 



r(a) = (l-p)Ea(T*)-c|E. (T*)] 1/2 . 



(11) 



(ITOjl can be rewritten as E q [$(Q(1)) - $(q)] < -F(q a ). 
Using the Markov property and ([2]), this gives 

E q t*(Q(2)) - $(q)] 

<-E q a (r(q a )+r(q 1 + S(T*))). (12) 



When <3>(q) = |q a | + |q l | is large, at least one of the 2R 
coordinates of q must be large. Since E a (T*) > a r /(l — E£) 
(as a consequence of T* > r r and E a (r r ) = a/(l — E^)), it 
is not hard to show that 

. E q . (r(q a ) + r(q' + s(r*))) 

lim mf — > 0, 

K^ + oo q :S( q )>if $(q) 

which completes the proof of the result. □ 

Now that we have stability and finiteness of the stationary 
mean, we prove that a(oo) — 2. The proof will make use of 
the following result. 

Lemma 5.4. If (Wj) are J i.i.d. random walks started at 
with non-negative step distribution having mean m and vari- 
ance w, then for any x = ixj) £ N J it holds that, 



E 



m&xWjixj) 
j 



1/2 



> (m|x|) 1//2 — (w jm A> '^)|x 



3/2n 



-1/2 



Proof. Since E(ma% Wj(x 3 -)) > maxjE(Wj(xj)) it is 
enough to prove the result for J = 1. Fix k > and let 
y = (Wi(fc) - mfc)/(mfc) 1 / 2 and 

/(,)= 1+j//2 - 2 (1 + y)1/2 ,,>-l. 

y 

Since EY = and Y > — (km) 1 / 2 , we can rewrite after some 
algebra 

E(Wi(fc) 1/2 ) = (fcm) 1/2 - (fcm)- 1/2 E [r 2 /((fem)" 1/2 V)] , 

and since sup/ = 1/2 and E(Y 2 ) = w/m this gives the 
result. □ 

Theorem 5.5. If(3 = +oo, then 
< liminf [(1 - p) 2 Eoo (||Q(0)||)] 

< limsup [(1 - p) 2 Eoo (||Q(0)||)] < +oo. (13) 

7n particular, a(oo) = 2. 

Proof. Since /3 = +oo, we have 

Eoo(||Q(0)||) = EoodlA^II) = iZE^^iCO)), 

and so we only need to prove the following upper and lower 
bounds: 



limsup [(1 - p) 2 E oc (4i(0))l < +oo, (14) 
pfi 



and 



lim 

pt 



inf {(l-p)E oo [A 1 (0) 1/2 ]} > 0. (15) 

Proof of JHJ. Starting from @, using that E a (ri) = a/(l — 
E£), subtracting on both sides E(£)Eoo(ti) (for this precise 
operation we need the finiteness of the stationary first mo- 
ment, to avoid doing oo — oo) and dividing by E£, we end 
up with 



El 1 ' 



Eoo (Ai(0))=Eoo (T*-n). 



1-E£, 

Then, adding and subtracting r max in the right hand side, 
and using that E£ = p/2, we obtain 

ff P (l - p)Eoo (Ai(0)) = Eoo (r max - n) + Eoo (T* - r max ) , 



with g p = 4/(p(2 — p)). Thus in view of Lemma 15.11 to 
prove (|14[) we only have to show that 



hm sup 



E 



oo \ I max 



1/2 



< +00. 



(16) 



Applying Lemma 15.21 to r max under P a (equal in distri- 
bution to max r V r (a r )), we obtain, denoting temporarily 
fi = 1/(1 - E£), E a (r max - n) < p(\a\ - oi) + Rv\a\ 1/2 . 
Integrating over the stationary distribution of Q and using 
Jensen's inequality, we obtain 

Eoo (r max - n) < /iEoo (|A(0)| - Ai(0)) 

+ Ru[E ao (A!(0))] 1/2 . 
In particular, to prove (|16p it is enough to show that 



(17) 



,. , Eoo (|A(0)|-Ai(0)) 

limsup ( — ^ — — ; ,„ I < +oo. 



Pti V [Eoo(A!(0))] 1/2 



We have already shown in the proof of Proposition 15.31 that 
Eoo (|A(0)|) = Eoo (|S(T*)|) < E(0E oo (T*)+flu [E^T*)] 172 , 
and since Eoo(Ai(0)) = E(£)Eoo (T*) this gives 

Eoc. (|A(0)| - Ai(0)) < Jfc> pMAi(0))/E£] 1/2 . 
This proves (|17|l which completes the proof of (|14|) . 

Proo/ of (fig)). We have 

Eoo[(Qr(0)) 1/2 ] =E 00 [(S 1 (T*)) 1/2 ] >E OQ [(S' 1 (r max )) 1/2 ]. 

Applying Lemma f5.4l to 5*i (r max ) by using the independence 
between Si and r max , we obtain 

Eco[(Ql(0)) 1/2 ] > (E0 l/2 Eoo[(w) 1/2 ] 

cEqo j^(T max ) j 

for some finite constant c independent of p. Applying again 
Lemma 15.41 to r max we obtain 



Eco[(Ql(0)) 1/2 ] > (fim 1/2 ^(\Q a (0)\ 1/2 ) 

-cE oo (|Q a (0)|- 1/2 )-cE oo [(r max )- 1/2 ] . 

Subtracting (^Ef) 1/2 E oo (Q?(0) 1/2 ) on both sides we finally 
end up with 

h p (l - p)E oc [(Q a 1 (0)) 1/2 ] 

> (jjEQV'Eoa (|Q a (0)| 1/2 - (Q?(0)) 1/2 ) 

- cEoo (|Q a (0)r 1/2 ) - cEoo [(r max )~ 1/2 ] , 

with h p = (l-(p/(2-p)) 1/2 )/(l-p) ->• lasp->- 1. The two 
last terms of the previous lower bound vanish as p f 1. As 
for the first term, it is not hard based on ([2)| to show that the 
ii-dimensional vector ((Q?(0) + 1)" 1/2 (Q°(0) - Qf(0)),r S 
7?.) under converges weakly as p f 1, from which one 
readily deduces thanks to the continuous mapping theorem 
that |Q a (0)| 1/2 - (Q?(0)) 1/2 also converges weakly as pfl 
to a random variable which is not identically zero. Using 
Fatou's lemma, this implies that 



lim inf 1 

pti 



|Q a (0)| 1/2 -(Q?(0)) 1/2 ) >0, 
which concludes the proof. □ 

5.2 The case p > 1 

5.2.1 Main result and a perturbation argument 

In the previous subsection, we analyzed in detail the case 
f3 = oo and proved a(oo) = 2. On the other hand, the sim- 
ulation results, see Figure [5] strongly suggest that ot(0) = 2 
for any ft > 1, and the goal of this subsection is to explain 
this result. The key to understand the behavior of Q for 
/? > 1 is the following proposition. In the remainder of this 
subsection we assume for simplicity that p = 1, but all the 
results below could be shown to hold uniformly in p < 2 
(all we need is the active queues to be stable) which is what 
would be needed to use these results in a heavy-traffic set- 
ting where p f 1. In the sequel, we let T\ be the time at 
which A\ advertizes a release for the first time. 



Proposition 5.6. If /3 > 1, then (Ai(Ti),Ti - n) un- 
der P ai converges weakly as ax — > +00 to a non-degenerate 
random variable. 

By non-degenerate, we mean a random variable (X, Y) £ 
N x Z such that both X and Y are non-deterministic and 
almost surely finite (an explicit expression for the weak limit 
of (j4i(Ti),Ti — ri) is given in Lemma |5.9[) . Note that from 
Proposition [576J which is concerned with the behavior of one 
active queue, one can easily deduce the behavior of A(T*) 
and T* (the proof of the following result is only sketched in 
order to comply with page limitations). 

Corollary 5.7. Assume that /3 > 1 and consider any 
sequence of initial states a n = (a n ,r) such that min,- a n ,r — > 
+00: then (A(T*),T* — r max ) under P a „ converges weakly 
as n — >• +00 to a non-degenerate random variable. 

Proof (sketch). At time T* , each active queue needs 
to have advertized a release at least once. At this time, the 
active queue was of order one by Proposition 15.61 and since 
it is stable, it remains of order one at time T* . □ 

In the extreme case /3 — +00 we have by definition A(T*) = 
and (by Lemma 15.1)1 T* « r max . By showing that both 
A(T*) and T* — r max are of order one when ft > 1, Corol- 
lary 15.71 therefore justifies seeing the case ft > 1 as a per- 
turbation of the case f3 = +00. In particular, treating 
Eoo(A(T*)) and Eoo(T* — r m ax) as constants, the heuris- 
tic reasoning outlined in Section [3]3] goes through and again 
predicts a scaling exponent a(/3) — 2 for any f3 > 1. This is 
indeed in very good agreement with the simulation results 
discussed in Section [4]2] and was stated as our main result[2] 
The remainder of this subsection is devoted to the proof 
of Proposition 15.61 The proof proceeds in two steps: first 
we show that the proof reduces to proving , which con- 
cerns a property of some random walk drifting to +00 and 
conditioned on staying positive. Then we prove that this 
property holds when ft > 1. 

5.2.2 First step: Reduction to (fT8|) 

Since before time Ti, the active queue A\ does by defini- 
tion not advertize any release, it is enough to prove Proposi- 
tion l5.6l in the case £ = 0, which we assume in the remainder 
of this subsection. In particular, Ai is a random walk with 
step distribution £ — 1 reflected at 0. The reduction of the 
proof of Proposition 15. 61 to proving (|18|) relies on the follow- 
ing coupling of the processes A\ for all possible initial states 
a > 0. 

Let V and be two independent processes with the 
following distribution. Let V be a version of Ai under Pq, 
i.e., (y(k),k > 0) is a random walk started at 0, with step 
distribution £ — 1 and reflected at 0. 

Let be a random walk started at 0, with step distri- 
bution 1 — £ and conditioned on never visiting after time 0: 
since E(l— f) > this conditioning is well-defined. Let more- 
over «(a) = max{fc > : W^(k) = a} be the time of last 
visit to a 6 {0, 1, . . . , 00} (to be understood as n(a) = +00 
for a = +00), so that n(a) is almost surely finite if a is fi- 
nite. Let finally W% be the process stopped at /t(a), i.e., 
W2{k) = W t {k) if k < «(a) and Wj(fe) = VK f («(a)) = a if 
k > ft(a). 

Lemma 5.8. Extend A\ on Z by setting Ai(k) = Ai(0) 
for k < 0, and let A + = (Ai(fc - n),k > 0) and A' = 



{Ai(-(k-Ti)),k > 0). Then for any finite a > 0, (A + ,A~) 
under ¥ a is equal in distribution to (V,WJ). 

In particular, as a — > +00, (A + , A~ ) under P a converges 
weakly to (V, W t ). 

Proof. That A + is equal in distribution to V and is inde- 
pendent from A~ follows from the strong Markov property 
at time ri. That A' is equal in distribution to Wj comes 
from duality. The weak convergence result then follows from 
the fact that n(a) — > +00 almost surely as a — > +00, so that 
(V, Wl) -> (V, W^) almost surely asa-> +00. □ 

Essentially, this representation of Ai shifts the origin of 
time at t\: A + looks at Ai from time t\ forward in time, 
while A~ looks at A\ from t\ backward in time. Moreover, 
this representation couples all the processes A\ with different 
initial states on the same probability space, which yields a 
simple representation for the law of (A\(Ti),T\ — n). Let 
in the sequel Z — (Z(k),k G Z) be the following process 
(indexed by Z): Z{k) = V(k) if k > and Z{k) = W r {-k) 
if k < 0. The previous coupling immediately implies the 
following result. 

Lemma 5.9. Let (Uk, k £ Z) be i.i.d., uniformly distributed 
in [0, 1], independent from Z, and for < a < +00 and 
k£Z let 

Jo ifk<-n(a), 

L>a,k = < 

[ 1 {y fc <^(z(fc))} etse > 

and Ta = inf {k £ Z : D a> k = 1}. Then for any finite a > 
0, (j4i(Ti),Ti — n) under ¥ a is equal in distribution to 
(Z(Ta),Ta) and in particular, it converges weakly as a — > 
+00 to (Z(TZ),T£) (with Z(T£) = +00 ifTl = -ooj. 

PROOF. The equality in law between {A\(T\), Ti— Ti) and 
(Z(Ta),Ta) is clear from the construction, and not difficult 
(although a bit heavy in notation) to formalize. Moreover, 
T^ is by construction decreasing in a and so its limit as a — > 
+00 exists. It is not hard to show that its limit is exactly 
and by continuity we deduce that Z{T^ ) -¥ Z(T^) as 
a — > +00, which implies the result. □ 

Thus to prove Proposition 15.61 we only have to establish 
that \T^\ is (almost surely) finite. Since V is positive recur- 
rent and starts at 0, it is clear that min{fc > : Doo.fc = 1} 
is finite and so to prove that \T^\ is finite, we only have to 
demonstrate that inf {A: < : Doo.fc = 1} is finite. In other 
words, we have to prove that sup {k > : U- k < 4>{W\k))} 
is finite, which informally means that advertizes a re- 
lease only finitely many times. 

So in the sequel, we consider (Uk,k > 0) i.i.d. random 
variables, uniformly distributed on [0, 1] and independent of 
, and we define 

N = Y1 1 {U k <i,(W-f(k))} 
k>0 

as the number of times advertizes a release. The proof 
of Proposition 15. 61 will thus be complete if we can prove that 

P(N < +00) = 1. (18) 

5.2.3 Second step: Proof of (fT8|) 

We now assume that /? > 1 and we prove that P(iV > 
n) — > as n — > +00, which will prove (|18|) . By definition, 

P (N = 0) = P (u k > ip{W^(k)),k > 0) , 



and since and the Uk's are independent this gives 



P(7V = 0) =E 



Let a > 0: introducing 95(a) 

l(fft(lc)=o)i 



•log(l - ip(a)) and L T (a) 



JZfc^n l/wt/n-»\, the local time at level a, we obtain 



>(N = 0)=E 



exp ~y^y( a ) Lt ( a ) 



(19) 



Lemma 5.10. The quantity sup a>0 E(Z/ (a)) is finite. In 
particular, F(N = 0) > 0. 

Proof. Let W~ be a random walk with step distribution 
£ — 1, started at and independent from W^, and for fcsZ 
define W*(k) = W^fc) if k > and W*(fc) = W~(-Jfe) if 
fc < 0. Thus, defining L*(a) = ~}2 ksz l{w*(k)=a} we have 
the obvious inequality L'(a) < L*(a) and so we only have 
to prove that sup agz E(L*(a)) is finite. 

It is clear that L* stays the same if W is shifted in time, 
and that shifting L* in time amounts to shifting W* in space. 
Moreover, for any w G Z the process (W*(k) + w,k £ Z) 
shifted at the time of last visit to is equal in distribution 
to W* . Combining these facts, we see that L* is a station- 
ary sequence and in particular, sup agz E(L*(a)) = E(L*(0)). 
But by the strong Markov property, it is clear that L* (0) is 
a geometric random variable, in particular it has finite first 
moment. This proves the fmiteness of sup a E(£J(a)). 

As for P(iV > 0), we have 

E ^^(0)^(0) <supE(L t (a))^^(a), 

\a>0 J a a>0 

and since <p(a) ~ as a — > +00 and /3 > 1, the sum 
2~2 a ¥>(°) i s finite which implies, in view of the last display, 
that the random variable ^2 a ip(a)L'(a) is almost surely fi- 
nite. This proves P(iV = 0) > in view of (|19p and con- 
cludes the proof of the lemma. □ 

We now prove that P(iV > n) — > as n — > +00. Let 
W be a random walk with step distribution 1 — £ and I = 
infi,>i W(k). Then, is by definition equal in distribution 
to W under Po( ■ | I > 1) (where the subscript refers to the 
initial state of W). Let moreover B n be the time at which 
W advertizes a release for the nth time, so that 



J o (B n < +00, Bn+l = +00, / > 1) 



with b = l/Po(/ > 1). Writing the event {I > 1} as the 
union between the two events {infi<fe<B n W(k) > 1} and 
{inffe>_B„ W(k) > 1}, the strong Markov property at time 
B n entails 



F(N = n) = 6E ( p{W(B n ));B n < +00, ^in^ W(k) > 1 



with p(w) =f w (N = 0,1 > 1). Coupling W under ¥ w with 
a version of W under Po that stays below it, it is easy to see 
that p(w) is increasing in w and so 



¥(N = n)> bp(0)¥ (B n <+oo, ^inf^ W(k)>l 



This last lower bound is equal to p(0)¥o(B n < +00 I > 1) 
which by definition is equal to p(0)¥(N > n). Since p(0) = 
P(iV = 0)/b is > by Lemma dividing by p(0) leads to 



F(N > n) < 



b¥(N = n) 



Since P(JV = n) 
Proposition 15.61 



V{N = Q) ' 
0, this finally proves (|18|) and hence 



5.2.4 More on the case f} > 1, p w 1 

The simulation results in Section [4] show a rather fuzzy 
behavior of a(j3) for /3 close to 1. Indeed, the curves shown in 
Figure [5] are smooth for small and large values (say, f3 < 1/2 
and /3 > 1.2) but for /3 close to one it is difficult to obtain 
stable numerical results. Our goal here is to discuss potential 
interesting phenomena arising for /3 > 1 close to one. 

The shape of the function p n> F(p, 0) depicted in Fig- 
ure^ is typical for large values of /?, say /3 > 1.2. In partic- 
ular, this function is increasing which makes the regression 
of F against 1/ log (1/(1 — />)), such as in JSJ, reasonable. 
However, as /3 gets closer to one, the shape of this function 
changes. For instance, Figure [7] shows simulation results for 
F(p, 1.2) which are representative of F(p, 0) for small ft, say 
1 < ft < 1.2. Noticeably, the function F(p, 1.2) is not mono- 
tone in p and so the approximation ([8]) cannot be valid for 
every p. Rather, we find that F(p, 0) decreases and then 
increases, and that the regression against l/log(l/(l — p)) 
is only accurate past the minimum. 





F(p, 1.2) 








result 














■ 







> bp(0)P (B n < +OO, I > 1) . 



log(l/(l - p)) 

Figure 7: Approximating a(1.2) for R = 2. 

Regressing the curve obtained in Figure [7] past the min- 
imum leads to the approximation q(1.2) ~ 1.94, which is 
still very much in line with our theoretical result q(1.2) = 2. 
However, the point where the minimum of the function p n> 
F(p, 0) is attained, shifts to the right when f3 gets closer 
to 1, leaving us with less points against which to regress. 
In particular, for ft < 1.2 one would need to simulate the 
system at loads higher than 0.999 to get accurate results. 

We suspect that this numerical instability is caused by 
heavy tails phenomena that seem to appear for /3 < 2. More 
precisely, inspecting the proofs of Proposition 15.31 and The- 
orem 15.51 one sees that Corollary 15.71 is not strong enough 
for the proofs of the case ft = +00 to go through directly. 
Indeed, instead of controlling the behavior of A(T*) and T* 
in distribution, one needs (at least, with the proposed proof 
strategy) to control their mean behavior. Let us do a small 
computation: let be the random variable introduced in 
Lemma 15.91 which is the weak limit of T\ — n, and let B^ 



be the first time the process advertizes a release. Then 
E(|T^|;T^ < 0) = E(B t ;B t < +00) = ^ k¥(B r = k), 



and, as before, we have 



k)=E 



ip{W\k))\{{l-Tp(W\i))) 



which 



Thus for large k, we should have P(B^ = k) « 
suggests that _B^, and in particular \T^\, has infinite mean 
for < 2, although both random variables are almost surely 
finite for /? > 1. It is possible to use this result to show that 
the (almost surely finite) weak limits of A(T*) and T* — 
Tmax have infinite first moment. This potentially invalidates 
the back-of-the-envelope computations in Section[3]3] and so 
more care is needed. For instance, simulation experiments 
for /3 = 1.2 and R = 2 suggest that K ai (Ai(T*)) grows as 
a?' 4 as ai — > +00. 

6. CONCLUSION AND FUTURE WORK 

In this paper, we pointed out to a mean-reverting effect 
which explains the performance of "cautious" activation rules 
in queue-based schemes for distributed medium access con- 
trol, showing that delays grow like — p) as p f 1. This 
motivated us to investigate more aggressive schemes. Our 
main contribution lies in highlighting, in this regime, a new 
effect that we called a lingering effect. We provided a de- 
tailed study of this effect and analytically showed that it can 
make delays grow with 1/(1 — p) at a quadratic rate. 

We believe that our work opens many exciting directions 
for future research. Our results leave open the question of 
intermediate values of the aggressiveness parameter, where 
these two effects could potentially coexist and lead to better 
performance. Our analysis also suggests that a full rigorous 
proof of the main result [2] may be within reach. Finally, we 
restricted ourselves in this paper to a simple topology, and it 
would be interesting to understand the kind of topologies in 
which the lingering effect can rear its head. This effect being 
due to central limit theorem type arguments, we believe that 
it may play a role in a wide range of topologies. 
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